Moroccan Data-Driven Spelling Normalization Using Character Neural Embedding

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handwriting Moroccan regions recognition using Tifinagh character

The territorial organization of Morocco during administratives division of 2009 is based on 16 regions. In this work we will create a system of recognition of handwritten words (names of regions) using the Amazigh language is an official language by the Moroccan Royal Institute of Amazigh Culture (IRCAM) (2003a) [1] such as this language is slightly treated by researchers in pattern recognition...

متن کامل

Data-driven Schema Normalization

Ensuring Boyce-Codd Normal Form (BCNF) is the most popular way to remove redundancy and anomalies from datasets. Normalization to BCNF forces functional dependencies (FDs) into keys and foreign keys, which eliminates duplicate values and makes data constraints explicit. Despite being well researched in theory, converting the schema of an existing dataset into BCNF is still a complex, manual tas...

متن کامل

Applying Data-Driven Normalization Strategies for qPCR Data Using Bioconductor

High-throughput real-time quantitative reverse transcriptase polymerase chain reaction (qPCR) is a widely used technique in experiments where expression patterns of genes are to be profiled. qPCR is widely accepted as the ”gold standard” for analysis of gene expression. Recent technological advances have greatly expanded the total number of genes that can be analyzed in a single assay; qPCR exp...

متن کامل

Data-Driven Spelling Correction using Weighted Finite-State Methods

This paper presents two systems for spelling correction formulated as a sequence labeling task. One of the systems is an unstructured classifier and the other one is structured. Both systems are implemented using weighted finite-state methods. The structured system delivers stateof-the-art results on the task of tweet normalization when compared with the recent AliSeTra system introduced by Ege...

متن کامل

Data Embedding for Camera-Based Character Recognition

In this paper, the embedment of class information into each character image is investigated for camera-based character recognition as easy and accurate as bar-code reading. Each character image is printed with a horizontal stripe pattern, called a cross ratio pattern, and the class information is represented as a cross ratio derived from the pattern. Since the cross ratio is invariant to projec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Vietnam Journal of Computer Science

سال: 2020

ISSN: 2196-8888,2196-8896

DOI: 10.1142/s2196888821500044